FAFNet: Fully aligned fusion network for RGBD semantic segmentation based on hierarchical semantic flows
نویسندگان
چکیده
Depth maps are acquirable and irreplaceable geometric information that significantly enhances traditional color images. RGB (RGBD) images have been widely used in various image analysis applications, but they still very limited due to challenges from different modalities misalignment between depth. In this paper, a Fully Aligned Fusion Network (FAFNet) for RGBD semantic segmentation is presented. To improve cross-modality fusion, new fusion block proposed, features depth first fused by an attention cross module then aligned flow. A multi-layer structure also designed hierarchically utilize the block, which not only eases issues of low resolution noises reduces loss upsampling process. Quantitative qualitative evaluations on both NYU-Depth V2 SUN RGB-D dataset demonstrate FAFNet model outperforms state-of-the-art methods.
منابع مشابه
Improving Fully Convolution Network for Semantic Segmentation
Fully Convolution Networks (FCN) have achieved great success in dense prediction tasks including semantic segmentation. In this paper, we start from discussing FCN by understanding its architecture limitations in building a strong segmentation network. Next, we present our Improved Fully Convolution Network (IFCN). In contrast to FCN, IFCN introduces a context network that progressively expands...
متن کاملTraining Bit Fully Convolutional Network for Fast Semantic Segmentation
Fully convolutional neural networks give accurate, per-pixel prediction for input images and have applications like semantic segmentation. However, a typical FCN usually requires lots of floating point computation and large run-time memory, which effectively limits its usability. We propose a method to train Bit Fully Convolution Network (BFCN), a fully convolutional neural network that has low...
متن کاملRGBD Semantic Segmentation Using Spatio-Temporal Data-Driven Pooling
Beyond the success in classification, neural networks have recently shown strong results on pixel-wise prediction tasks like image semantic segmentation on RGBD data. However, the commonly used deconvolutional layers for upsampling intermediate representations to the full-resolution output still show different failure modes, like imprecise segmentation boundaries and label mistakes in particula...
متن کاملDistributed Information Retrieval Based on Hierarchical Semantic Overlay Network
One fundamental problem that confronts information retrieval is to efficiently support query with higher accuracy and less logic hops. This paper presents HSPIR (Hierarchical Semantic P2P-based Information Retrieval) that distributes document indices through the P2P network hierarchically based on documents semantics generated by Latent Semantic Indexing (LSI) [1]. HSPIR uses CAN [2] and Range ...
متن کاملManual Segmentation and Semantic-based Hierarchical Tagging of 3D models
Today 3D objects have become widely available in different application domains, thus it is becoming fundamental to use, integrate and develop techniques for extracting and maintaining their implicit knowledge. These techniques should be encapsulated in intelligent systems able to semantically annotate the 3D models, thus improving their usability and indexing, especially in innovative web coope...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Iet Image Processing
سال: 2022
ISSN: ['1751-9659', '1751-9667']
DOI: https://doi.org/10.1049/ipr2.12614